Clustering of Correlated Documents into Designated Number of Clusters: A Practical Approach
نویسنده
چکیده
We consider a complete graph to cluster the vertices into k-clusters, where each edge of the graph is labeled either as “+” or “–”. “+” denotes that vertices incident to the edge are mutually related and “–” edge denotes that the vertices incident to the edge are mutually unrelated. The goal of the clustering is to place vertices into k clusters, where documents are clustered with maximally related items. That is, clustering should maximize the agreements (“+” edges inside the clusters and “–” edges between the clusters), or equivalently minimizes the disagreements (“–” edges inside the clusters and “+” edges between the clusters). We give a simple algorithm for the maximizing the agreements and test the success of the algorithm. We compare approach with Bansal et. al’s approach proposed in [1]., and conclude that complicated algorithms that have exponential run time are not practical.
منابع مشابه
Data Clustring Using A New CGA(Chaotic-Generic Algorithm) Approach
Clustering is the process of dividing a set of input data into a number of subgroups. The members of each subgroup are similar to each other but different from members of other subgroups. The genetic algorithm has enjoyed many applications in clustering data. One of these applications is the clustering of images. The problem with the earlier methods used in clustering images was in selecting in...
متن کاملData Clustring Using A New CGA(Chaotic-Generic Algorithm) Approach
Clustering is the process of dividing a set of input data into a number of subgroups. The members of each subgroup are similar to each other but different from members of other subgroups. The genetic algorithm has enjoyed many applications in clustering data. One of these applications is the clustering of images. The problem with the earlier methods used in clustering images was in selecting in...
متن کاملComparison of Strategic Plans of Universities and Institutes of Higher Education with a Quantitative Approach
Strategic planning in Iranian universities and institutes of higher education is generally prepared using strategic planning models introduced by experts and other universities. These programs will be published in the form of university strategic planning documents. These documents have such features that can be similar or different than the programming templates used. Existence of the similar...
متن کاملNew Approach for Customer Clustering by Integrating the LRFM Model and Fuzzy Inference System
This study aimed at providing a systematic method to analyze the characteristics of customers’ purchasing behavior in order to improve the performance of customer relationship management system. For this purpose, the improved model of LRFM (including Length, Recency, Frequency, and Monetary indices) was utilized which is now a more common model than the basic RFM model apt for analyzing the cus...
متن کاملA clustering approach for mineral potential mapping: A deposit-scale porphyry copper exploration targeting
This work describes a knowledge-guided clustering approach for mineral potential mapping (MPM), by which the optimum number of clusters is derived form a knowledge-driven methodology through a concentration-area (C-A) multifractal analysis. To implement the proposed approach, a case study at the North Narbaghi region in the Saveh, Markazi province of Iran, was investigated to discover porphyry ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005